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Set Items Description 

51 158787 S (E OR ELECTRONIC) () (DOCUMENT?? OR MAIL?? OR PAPER?? OR THESES 
OR DISSERTATION? ? ) OR ( INTERNET OR WEB) (IN) (PAGE?? OR BROWSER??) 

52 3487613 S DOMAIN?? OR HEADER?? OR DATA () FRAME?? OR HTTP OR HTML OR 
JAVA () SCRIPT OR CODE?? OR CODING OR ENCODING OR TOP () LEVEL?? 
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CHINESE OR FRENCH OR RUSSIAN OR SLAVIC OR ARABIC OR ARAMAIC OR HINDI 
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54 241948 S (DETERMINE?? OR DETERMINING OR DETERMINATION OR ESTIMAT??? OR 
EVALUAT? ? ? OR APPROXIMAT?? ? OR PROBABILITY OR STATIST? OR AUTOMAT? OR DETECT??? 
OR COMPIL?) (ION) S3 

55 32538 S AU= (FRANZ, A? OR FRAN A? OR MILCH, B? OR MILCH B? OR JACKSON, 
E? OR JACKSON E? OR ZHOU, J? OR ZHOU J? OR DIAMENT, B? OR DIAMENT B?) 

56 18442 S SI AND S2 

57 423 S S6 AND S4 

58 327 RD (unique items) 

59 202 S S8 NOT PY>2003 

510 3 S S9 AND (WEIGHT??? OR SCORE OR RANK???) 

511 30 SS9 AND ( FREQUENT OR FREQUENCY OR OCCURRENCE?? OR OFTEN OR 
REPEAT??? OR REPETITIVE OR RECUR? OR NUMBER OR TIMES) 

S12 
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S14 

SEGMENT??? 
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S16 
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S18 
S19 
S20 
S21 



29 S Sll NOT S10 
0 S S12 AND (LANGUAGE (2 ON) (HEADER OR FRAME??)) 
6 S SI 2 AND (PARSE?? OR PARSING OR PREFIX OR SUFFIXX OR 

OR GRAMMAR?? OR SEMANTIC?? OR SYNTAX) 
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Title: Speech input interface for Web page query based on a dynamic language model architecture 
Abstract: Speech query is more natural and convenient than text in the Web page information retrieval. The 
severe problem for speech query is that there exist out-of-vocabulary words or unknown words in most of the 
Web pages and this induces the difficulty to recognize the speech from syllable lattice into the correct text In 
this paper, we propose a dynamic language model architecture that is a Web page based language model to 
correctly convert speech into text The proposed method extracts language information and builds a weighted 
language model for each browsing page before the HTML document goes into the browser. In this manner, the 
language information for the Web page including those out-of-vocabulary words are well estimated in the 
weighted language model In recognizing the speech into text, the weighted language model is first looked up and 
returns a weighting factor for the universal language model. Eventually, the syllable lattice of the recognized 
speech can be correctly decoded into text for text query. (Author abstract... 
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computer environment, comprising the steps of: automatically determining the language and country of a Web site 
visitor;directing a Web server to deliver the appropriate localized content contained in one or more country /language 

databases and/or file systems to said visitors browser,informing language in the requested document, wherein said 

browser is allowed to download said font from said process;intercepting input text that is submitted using an HTML 
form; writing said input text into a form database in a manner so that it is translated later, wherein said form database 
includes information to identify the country, language and encoding of said text to interpret it for subsequent 
translation;identifying content that needs to be translated and staging the content for offline translation by at least 
one manual translation resource and at least one automatic translation resource;dynamically routing and sequencing 
said content to the at least one manual translation resource and at least one automatic translation resource, wherein said 
routing and sequencing is performed according to any of: the subject matter of the document to be processed, target 
language of the translations and whether draft-only or high quality is required; andproviding a database viewer which 
allows the translated content to be viewed in the context of the form in which it was originally entered. 
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